Query Steering for Interactive Data Exploration

نویسندگان

  • Ugur Çetintemel
  • Mitch Cherniack
  • Justin DeBrabant
  • Yanlei Diao
  • Kyriaki Dimitriadou
  • Alexander Kalinin
  • Olga Papaemmanouil
  • Stanley B. Zdonik
چکیده

Traditional DBSMs are suited for applications in which the structure, meaning and contents of the database, as well as the questions to be asked are already well understood. There is, however, a class of applications that we will collectively refer to as Interactive Data Exploration (IDE) applications, in which this is not the case. IDE is a key ingredient of a diverse set of discovery-oriented applications we are dealing with, including ones from scientific computing, financial analysis, evidence-based medicine, and genomics. The need for effective IDE will only increase as data are being collected at an unprecedented rate. IDE is fundamentally a multi-step, non-linear process with imprecise end-goals. For example, data-driven scientific discovery through IDE often requires non-expert users to iteratively interact with the system to make sense of and to identify interesting patterns and relationships in large, amorphous data sets. To make the most of the increasingly available complex and big data sets, users would need an “expert assistant” who would be able to effectively and efficiently guide them through the data space. Having a human assistant is not only expensive but also unrealistic. Thus, it is essential that we automate this task. We propose database systems be augmented with an automated “database navigator” (DBNav) service that assists as a “tour guide” to facilitate IDE. Just like a car navigation system that offers advice on the routes to be taken and display points of interest, DBNav would similarly steer the user towards interesting “trajectories” through the data, while highlighting relevant features. Like any good tour guide, DBNav should consider many kinds of information; in particular, it should be sensitive to a user’s goals and interests, as well as common navigation patterns that applications exhibit. We sketch a general data navigation framework and discuss some specific components and approaches that we believe belong to any such system.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

AIDE: An Automated Sample-based Approach for Interactive Data Exploration

In this paper, we argue that database systems be augmented with an automated data exploration service that methodically steers users through the data in a meaningful way. Such an automated system is crucial for deriving insights from complex datasets found in many big data applications such as scientific and healthcare applications as well as for reducing the human effort of data exploration. T...

متن کامل

Computational Steering

Computational steering is the online management of the execution of an application and its resources for the purpose of either performance improvement or application exploration. Generally, visualizations are used to provide the user with insight into the state and behavior of the underlying system, and as a feedback mechanism, enabling users to gauge the effectiveness of these parameter adjust...

متن کامل

GraphVista: Interactive Exploration Of Large Graphs

The potential to gain business insights from graph-structured data through graph analytics is increasingly attracting companies from a variety of industries, ranging from web companies to traditional enterprise businesses. To analyze a graph, a user often executes isolated graph queries using a dedicated interface—a procedural graph programming interface or a declarative graph query language. T...

متن کامل

A Progressive Query Materialization for Interactive Data Exploration

Data analysis is task of competently digging out data insights even if user is uncertain but database systems unable to meet. User’s Iterative interactions with system might be an alternative. Interactive data exploration (IDE) is one such system, supports exploration by incorporating user intention. IDE is key ingredient of many discovery applications, such as scientific computing, financial a...

متن کامل

A Compilation Technique for Interactive Ontology-mediated Data Exploration

We make a step towards the interactive exploration of data in the ontology-mediated setting, presenting a technique to construct an offline compilation that allows us to answer efficiently different related queries in online phase, without the need to access again the original data. We also propose algorithms to construct relevant variations of a given query that, for example, reduce or increas...

متن کامل

Immersive Visualization Using AVS/Express

To achieve benefit and value from high performance computing (HPC), visualization is employed for a direct understanding of computational results. The more closely coupled the visualization system can be to the HPC data source, the better the chance of fostering discovery and insight. A software architecture is described, with discussion of the dynamic object manager allowing scalable high-perf...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013